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1. Project description 

1.1. Collection and information objects 

The collection for this Information Organization System is the Library of the British Historical Society of 
North Texas located in the society office in Dallas, Texas. The library itself contains three thousand 
books on all topics relating to the governmental/political, geographic, social and religious history of Great 
Britain from the Anglo-Saxons to the House of Windsor; except travel books. Topical examples include: 
the Magna Carta, the Plantagenets, the Norman Conquest, Boudicca, The Church of England, Cardinal 
Wolsey, 100 Years War, The Black Death, Elizabeth I and II, Benjamin Disraeli, Princess Diana and the 
IRA. The library's purpose is to provide society members and visitors access to the most comprehensive 
inventory of books about the history of Great Britain in North Texas. Books are purchased by the library, 
but it does except limited donations. 

1.2. Users' demographics and knowledge 

Frequent patrons to the BHSNT Library are Advanced Placement European History Students in the Piano 
Independent School District. These are students about sixteen to eighteen years in age both male and 
female in Piano high schools. They are honors, college preparatory students from middle to upper class 
families who love learning and European history specifically and are technology proficient. Being from a 
higher socio-economic class exposes children to more advantages when it comes to education and 
personally enriching experiences. These students intend to go to college and are demonstratively high 
achievers. 

User groups demonstrate four areas of knowledge when obtaining information: general, domain, system 
and information seeking. General knowledge is intellect, taste and experience. Domain knowledge is 
proficiency within the area of study. System knowledge is understanding the components of information 
networks. Lastly, information seeking knowledge is understanding how to search. Using the scale of low, 
moderate or high, information professionals make assumptions about the AP European History students' 
knowledge. Their level of general knowledge is moderate leaning towards high. Because of their age, the 
students have limited life experiences, even though some are quite astute. Domain knowledge is the 
same moderate with a little high. The students love European history but are not as prolific in it as 
college students. Their system knowledge is moderate as is their information-seeking knowledge. 
Because of their youth, their skills are not are as honed as an adult, but a few students are prodigies. The 
students levels of knowledge and demographics determine their information behavior. Therefore, 
designing a user-centric organization system keeps in mind the specific needs and behaviors of the 
students. 



1.3. Users' problems and questions 
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The students have specific reasons for contacting the BHSNT Library. They need to obtain books on 
British History by female authors for a research paper. The students have to complete a specific task, 
while learning about a subject. Each student being highly competitive is motivated for good grades in the 
class, advanced placement college credit and personal enjoyment. The AP European History classes 
complete two mandatory field trips to the BHSNT Library, during which they have the opportunity to pose 
questions to the library staff regarding research materials. Topics for the research paper come from their 
textbooks and/or class lecture. Based upon their topics and research paper perimeters, students 
formulate questions, pose these to the librarians and gather resources for their papers. Typical questions 
include: 

User question 1 : Please help me find two or three books written by Alison Weir in the last twenty years 

which discuss Perkin Warbeck and other pretenders to the throne during the Wars of the Roses for a 

research paper? 

Object attributes: Subject, Author, Date of Publication, Character Name 

Desired precision: High 

Desired recall: Low to Moderate 

User question 2: Where do I find three or four books, over 200 pages each, hardcover and written by 
Lady Antonia Fraser on the British monarchy from the Norman Conquest to the Restoration for a paper? 
Object attributes: Setting, Book Cover, Publisher, Length 
Desired precision: High 
Desired recall: Moderate 

User question 3: Help me locate four, first edition biographies on Queen Victoria with illustrations on the 

cover, syndicated outside Great Britain and in English for my term paper? 

Object attributes: Language, Genre, Book Edition, Cover Art, Place of Publication 

Desired precision: High 

Desired recall: Moderate to High 

User question 4: I need two contemporary historical fiction books on the Tudors which have been made 

into movies. Please help me locate them? 

Object attributes: Movie Connection, Book Content 

Desired precision: High 

Desired recall: Low 

Object attributes (Subject, Author, Setting, Character Name, Book Cover, Language, Publisher, Genre, 
Book Content, Book Edition, Cover Art, Date of Publication, Place of Publication, Length, Movie 
Connection), become elements in the Metadata scheme and fields in records. Therefore the quality of 
attributes determines the efficacy of the database. The attributes represented have to correlate to the 
users questions; otherwise, the students are unable to find anything. 

2. Representation of information objects 

2.1. Entity level 

Entity, within a metadata scheme, is the functional representation of an information object. Entity is the 
material information object or an idea which the information object conveys. Entity level or unit of analysis 
is the basic form of an object that is quantifiable. For the purposes of this collection, the entity level is an 
individual book. An individual book is the appropriate entity level because users are retrieving, reading 
and analyzing the entire book. A book is one object in the collection, which becomes an entity in the 
metadata scheme and a record in the database. 

2.2. Metadata elements and semantics 
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Eleven metadata elements represent the information objects, books, in the collection within the metadata 
scheme. Elements are appropriate representations of objects for users, because elements help users 
retrieve information objects. Please see Appendix A. The elements become fields and the objects 
become records within the database. Users enter a field; retrieve a record which they use to find the 
information object. The four generic tasks, within a library cataloguing system the user performs are: find, 
identify, select and obtain. These generic tasks originate from Functional Requirements for Bibliographic 
Records (FRBR). The records which the metadata generates buttress those user tasks. Those tasks are 
instrumental in helping users survey and judge records in a catalogue. 

Find processes are those elements which help users determine whether a particular information object 
exists. Find entities include: Author, Subject and Title which are integral for users in locating records and 
users are likely to know, see Appendix A. Identify elements help users determine whether objects are 
appropriate for his search request. These elements include the attributes of Setting and Genre which 
allocate perimeters for objects. Select processes help the users compare and distinguish between object 
that meet their needs and the needs and requirements of others. Select elements include the attributes 
of Language, Publisher, Date of Publication and Length. Obtain are those records the user attains when 
he procures the desired object. These are likely not to be known to the user before retrieving the object, 
i.e. ISBN and Classification, see Appendix A. 

2.3. Record structure and specifications 

There are thirteen fields in the database record. Eleven metadata elements from the original metadata 
scheme, plus RecordID and RecordDate comport into in the database as shown in Appendix B. 
Elements become fields, once input into the database. There are thirteen fields because each element 
maps one-to-one into a single field within the database. For example, Title in the metadata scheme 
corresponds to Title in the system; Author in the metadata scheme corresponds to Author in the system 
and Length in the metadata scheme corresponds to Length in the system. The additional fields of 
RecordID and RecordDate complete the thirteen fields in the database record. They precede the other 
elements in field order and the database, Inmagic for this organization system, automatically populates 
them. RecordID provides a singular and specific record number for a database record. RecordDate is the 
date and time of record generation or revision. 

In the database, there are four types of technical specifications. The four technical specifications are 
Field Type, Indexing, Entry Validation and Content Validation. The first technical specification in the 
database is Field Type. Field Type determines the type or what kind of information goes into a field. With 
the exception of RecordID and RecordDate, whose Field Types the database generates, all other Field 
Type entries are either Text or Date. RecordID and RecordDate Field Type entries are Autoname and 
Autodate respectively, see Appendix B. Database creators determine Field Type to be Text when entry 
terms have to be in the forms of words or symbols. Field Type is Date when the entry value is a date, 
calendar or period of time. Date of Publication is the only Field Type with the entry value of Date, because 
it is a specific date. The other eleven Field Types within the system are Text because the field entries are 
words or symbols. Users input terms into those fields accordingly. 

The second technical specification in the database is Indexing. An index is list of entries which point to or 
refer to a location. There are several types of indexes. Those indexes found in the backs of books refer 
to a location within the book where information about a particular entry exists. Indexing means to make a 
field "searchable." Through indexing the user approaches the database. Indexes point to a specific 
location or area. These searchable fields or searchable locations are the access points for the database. 
Indexing options are to search by sequence or by index files. Index files or inverted files are categorized 
lists of data in a database that: point to the data record and automatically generate when new records are 
created and separate for each indexed field. Within database structure, term indexing or term index files 
are the processes of using whole phrases to search, including words or numbers. Word indexing is the 
process of searching within the database by precise words only. Word indexing within a database 
searches for a particular term only, no modifiers or any additional terms. Within a system, term indexing 
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offers opportunities for greater recall. Users will retrieve more data with term indexing because it has a 
larger scope and searches for all related terms and phrases. Word indexing within a database is more 
precise and recall is less because it is a narrower type of searching. It depends upon the demographics, 
knowledge and sophistication of the users whether database creators or indexers creators choose term or 
word indexing their system. In this system, indexing for each field including RecordID and RecordDate is 
either term or word, with the exception of ISBN which has no indexing. The Title field has term and word 
indexing. Author has term and word indexing. Length has term and word indexing. Language has term 
and word indexing. Setting has term and word indexing. Publisher has term and word indexing. Genre 
has term and word indexing. Subject and Classification also have term and word indexing. ISBN is not 
user searchable; therefore, there is no indexing application or process the database system for ISBN. 
RecordDate and Date of Publication are the only two fields with term indexing because their entry data 
are specific dates. The remaining fields with the exception of ISBN have both term and word indexing. 
Indexers of this database feel utilizing both term and word indexing gives the users the most options for 
retrieving information objects. 

The third technical specification in the database is Entry Validation. It is the process which controls or 
regulates the amount of entries within a field. Database creators determine whether terms in the Entry 
Validation fields are: required, mandatory term entry in a field; single, one term entry in a field; or unique, 
only one value is suitable for entry in a field and appears nowhere else in the record. Within the database 
system every field has Entry Validation except RecordID, RecordDate and Classification. Record ID's 
and Record Date's Entry Validation have no value because the database populates the field for them; ate. 
Classification follows different input rules, and has no Entry Validation value in the system. Back to the 
fields with Entry Validation, the Title field has single and required Entry Validation because the record 
needs one entry for the field. The Length field also has single and required Entry Validation because the 
record needs one term entry. Language field has single and required Entry Validation, for the same 
reason because the database record needs one term entry. Genre also has single and required Entry 
Validation because its record needs one term entry in the database. . Author has required Entry 
Validation because it is possible to enter more than one term in its record field, but the field has to have 
an entry. The same is true for Setting, Publisher and Subject. They have required Entry Validation 
because it is possible to enter more than one term in their field records. Single validation entry fields are 
Date of Publication and ISBN. There is only one publication year to input in into the database record. 
While ISBNs are unique to each book, they are not searchable in the database by either term or word. 
Therefore ISBN is simply single Entry Validation in the database. After 2007, books often contain two 
ISBN numbers; the original ten digit number and the newer ten digit number. In that case, use the newer 
thirteen digit number. 

The Fourth technical database specification is Content Validation. It is the process which places 
limitations on the values that are put into fields. Only authorized terms gain entry into field. Content 
Validation is optional, not all fields have it. Database creators predetermine whether the system employs 
it. Content Validation mechanisms include drop-down validation lists, ranges and masks. In a drop-down 
list, the user selects a term from a designated registry of terms. In masks, the user enters data in a 
certain format with asterisks or dashed lined as boundary markers. In ranges, terms span a sequence 
between limits. The user inputs terms within the designated region in a range. Within the database, there 
are no values listed for Content Validation for RecordID and RecordDate. For the remaining records, the 
absence or presence of Content Validation and whether the validation is a mask or drop-down list, the 
system indicates. Date of Publication and ISBN have Content Validation masks in which data is input in a 
particular format. Length has a Content Validation list where the user selects a term from a drop-down 
list. Language also has a Content Validation list where the user chooses a term from a drop-down list. 
The same is true for Setting and Genre; they have Content Validation lists where the users select a term 
to input from drop-down lists. However in the Setting field users have the option to override the Content 
Validation list, and input their own term. Title has no specified Content Validation because it is optional 
and users have the option of not entering in data into the field. Consequently, the term "None" appears in 
its Content Validation field. Similarly for Author and Publisher, they have no specified Content Validation 
in their field records because it is optional and users have the option of not entering any in the fields. 
Subject field has a thesaurus, Appendix C, a specialized controlled vocabulary which appears as a list for 
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Content Validation in the system. As for Classification, it has no value in system for Content Validation. 
Classification follows different input rules, see Appendix E. 



2.4. Record content and input rules 

For uniformity and the purposes of establishing standards across a record, a metadata scheme imports 
rules for records from database structure and specification. These rules are both for content and input. 
Content rules define what data goes into a field and prompt the locus of data from an information object. 
This is the Chief Source of Information, where data arises. Input rules are the mechanics of data entry. 
These rules maintain uniformity, congruency and exactitude across the data record so that bibliographic 
records are standard in a catalogue. 

The Chief Source of Information for most data information in the database is each individual book. On 
the front cover, back cover, book jacket, title page and last page, users find most of their desired 
information. An usual feature within the metadata scheme is the ISBN numeration. Since 2007, ISBN 
numeration is thirteen digits instead often. Input rules adjust accordingly whether for ten digit or thirteen 
digit ISBNs. 

3. Access and authority controls. 

Authority control is the process of governing data in specific fields. Authority control homogenizes 
variations, enables consistency in terminology, and distinguishes between authorized and unauthorized 
terms for the purposes of collocation in a bibliographic record. Authority control regulates data for: 
persons, places or things Three types of authority control are: subject, content validation and name 
authority control. 

Content validation mandates the selection of terms from a list of authorized terms, predetermines data 
values in a field, or restricts data values input in a field. Drop-down lists and masks are examples of 
content validation tools. In a drop-down list, the user selects a term from a designated registry of terms 
and in masks the user enters data in a certain format (Appendix B and C). 

Subject authority control is the governance of subject terms. Subject authority control derives from 
control vocabularies. Controlled vocabularies assign lists of authorized terms for subject representation 
and also show interrelationship between terms semantically. Controlled vocabularies group terms which 
are permissible according to similarity into the database and cross-references between terms more 
broad, narrow or related. A thesaurus is an example of a controlled vocabulary. A thesaurus contains a 
list of subject headings with a cross-reference system and syndetic structure. The subject field within this 
database is under the control of a thesaurus. The subject field with its sophisticated semantic term 
interrelationships, more so than the other fields in the database, makes it a suitable field for a thesaurus. 

Name authority control regulates fields with proper names. These are names of people, organizations 
and groups which need to be standardized in a type of formal canon. Name authority control standardizes 
all capitalizations, spellings, and variance in the names of persons, organizations and groups and cross- 
references them against the endorsed form. Author is the field within the database under name authority 
control. In the name authority control records for author, rules dictate which version of name, spelling and 
order of entry into the record. Any issues in establishing the authorized form of a name appear in the 
notes of the authority record in Appendix C. 



4. Representation of information content 
4.1. Subject access 
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Subject access is the point, tool or mechanism for searching a database via the content of information 
object. Subject representation informs the users about the subject matter of the object and makes 
determinations on the content. Adequate subject representation enables users to locate and retrieve their 
desired information objects. Subject analyses are the processes of information professionals, indexers, 
cataloguers through which they determine whether the object is an appropriate information resource. The 
three steps in subject analysis are familiarization (familiarity of object content), extraction (identify and 
select ideas to represent) and assignment of terms (controlled vocabulary or natural language). Subject 
analysis organizes and clarifies the content of documents so they are consistent and compatible with 
search terms. 

The subject related fields in this organization system are: Subject, Genre and Setting. Subject access for 
Subject is a thesaurus. Subject access for Setting is natural language. Subject accesses for Genre and 
Setting are controlled vocabularies. A thesaurus is an appropriate subject authority control file for the 
subject field because of the amount of and complex relationships between terms. Setting uses natural 
language because the user selects a term from the object to name the concept. Genre and Language 
utilize controlled vocabularies for subject representation, because there are certain options for authorized 
terms, such as American English, Old English or fiction and non-fiction, which represent a concept. 
These pre-selected lists of terms remove decision-making from the users and are input into the database 
record according to specific rules. The organization system creator enters the authorized term and the 
user chooses from among those listed terms. 

Classification is the systematic organization of categories according to communities, similarities or pre- 
established criteria. Life sciences frequently use classification in areas of study. Classification occurs with 
objects too. Bibliographic classification, which occurs in information science professions and libraries, 
relies upon subject analysis. An assigned number or code represents the information object within an 
organization system, based upon the subject matter. 

4.2. Thesaurus structure 

Subject authority control is the process of pre-selecting and establishing a lexicon of subject terms and 
headings. Subject authority control is a utilization process of a controlled vocabulary by users in which 
they choose authorized terms from pre-selected lists or groupings. Subject authority control is effective 
for indexers, because they use descriptors which are succinct and all-encompassing to predetermine and 
coordinate valid term associations. Subject authority control is effective for users because they rely upon 
the pre-determined and pre-coordinated term selections of indexers and do not have to contrive their own 
term; therefore, controlled vocabularies are authoritative sources for terms and valid lists from which the 
user selects terms. 

The subject field is under the subject authority control of a thesaurus within the organization system. A 
thesaurus is a type of controlled vocabulary used in grouping many terms in varying theoretical 
complexities or strata (broader to narrower). Thesauri enhance retrieval efficacy because the user selects 
only sanctioned terms determined by indexers. Syndetic structure denotes the connections between 
terms in a thesaurus. Syndetic structure, regarding semantic relationships between terms in this 
thesaurus is hierarchical, equivalent and associative. Syndetic structure includes: broad over-arching 
terms, BT; narrow more specific terms, NT; related terms, RT; and those unsanctioned terms use and use 
for USE and UF. Mandatory reciprocals are terms in which one is searchable and the other is not, USE 
and UF in the database. Also, it is relationship in which the narrower term, NT is a broad term in one 
group and the inverse is true within the syndetic structure. In the hierarchical relationships, the broad 
terms are umbrella-like terms, but they are narrower than the thesaurus domain. In the equivalent 
relationships, the terms are nearly synonymous and the structure denotes the preferred term. In the 
associative relationships, terms are related. An example of syndetic structure is as follows. Law is the 
BT, broad term. Execution is a narrow term, NT. Death Penalty is a related term, RT. A mandatory 
reciprocal relationship is one step down in which execution becomes the Broad Term, BT and the NT is 
beheading. The equivalent relationship is bastard USE illegitimate heir and illegitimate heir UF bastard. 
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Domain is the range of topics covered by the thesaurus. Scope is any limitation placed on the domain. 
Within the thesaurus in Appendix D, the domain is the British Monarchy. The scope is monarchy from 
William I to Elizabeth II. 

Specificity is the degree to which the indexed terms match the subject of a request or document. 
Specificity is high in thesaurus, Appendix D. The terms represent subjects accurately and 
unambiguously. Disciplines with distinct vocabularies, such as law, have higher specificity. 

Exhaustivity is the degree to which indexed terms match all subjects in the document. In the thesaurus, 
Appendix D exhaustivity is low. The indexer intends only a few terms to represent the subjects in the 
document. The indexer gears toward summarization with low exhaustivity levels rather than depth 
indexing. In depth indexing exhaustivity is higher because indexers attempt to extract all subjects from a 
document. Depth indexing includes main topics and subtopics. It is too advanced to the users of this 
database who less familiar with subject content. Summarization within indexing covers the dominant 
subject of a document. Exhaustivity is lower and users only need a few appropriately assigned terms to 
obtain an object. Lower exhaustivity correlates with higher recall; therefore, precision does not have to be 
high for users to retrieve their desired object. With the high precision in the user question, users within 
this database are likely to retrieve their desired objects, see Appendix D. 

4.3. Classification scheme 

Classification is the systematic organization of a collection according to communities or like classes. The 
act of classifying is the designation of an object to a specific community, group or class based upon 
similarities. Classification denotes an object's or entity's intellectual access (subject-based) and physical 
access (unique identifier). Intellectual/subject-based access is the process of conferring a 
number/code/character upon an object based upon group similarity. Physical access is the process of 
conferring a unique identifier upon an object in the form of a number/code to physically locate or identify 
the object within the organization system. In the classification code, Tud.Pur.Nov.2008/1 , one is the 
unique identifier or unique number which is also the first database record creation. Classification schemes 
and system design place similar objects close to or within approximate relationship to each other. 
Collocation is this process of arranging or positioning similar objects near one another. Collocation 
makes it easy for users to find related materials in a system and classification makes is possible for users 
to retrieve organized information within a system. 

There are two approaches to classification, hierarchical and faceted. Hierarchical approaches to 
classification are those methods in which subjects and subject relationships are prearranged in an order 
of classes and subclasses. Faceted approaches to classification are those methods in which subject 
classes are predetermined but classes and subclasses are not prearranged. Facets are classes, types of 
classes or characteristics within a classification scheme that possess common attributes. Faceted 
classification approaches analyze information objects to determine their subjects based upon these 
common attributes. Some facets are subject descriptive characteristics, such as, topic, theme, or setting. 
Other facets are physically descriptive, such as, date and format. Hierarchical classifications often appear 
in the life sciences, as in animal phyla or systems of the human body. Hierarchical classification 
approaches cover broad subjects, detail every possible concept in the system, have detailed hierarchies, 
relatively few facets, and are fairly short and uncomplicated in notation. The set of codes representing 
classes and subclasses in a classification system is notation. Because hierarchical approach 
classification notation is simple, its notation is good for collocation Faceted approaches to classification 
cover a few, broad subjects; are very flexible; easily incorporate additional facets and classes within the 
scheme; cover non-text information objects, and have synthesized notation from the fusion of different 
facets to suit the information object. 

The classification scheme in the system is a faceted approach. The advantages of a faceted approach 
are that it: works well for a few broad, non-hierarchical subjects; the scheme itself is flexible; it is easily 
expandable and accommodates the use of non-text information objects, such as dates. The classification 
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scheme within this system has a few broad, non-hierarchical subjects, is flexible and has a variety of 
different facets and synthesized notation. 

The primary facet is theme in the classification scheme. Theme allows for over-arching subject based 
intellectual access and therefore is a good primary facet choice. The remaining are in order: Author, 
Form and Date. Objects are shelved in citation order: chronologically by theme, alphabetically by theme, 
alphabetically by author, alphabetically by form and chronologically by date. Because some authors, 
such as Alison Weir, write both biographies and histories this order collocates biographies and histories 
on the same theme by the same author. A unique identifier provides a singular and specific identifying 
code (call number) for an object. The RecordID which the database automatically generates is the 
unique identifier at the end of the notation code. An example is the classification code 
Tud. Pur. Nov. 2008/1 which represents the book Vengeance Is Mine. The book has a Tudor theme (Tud), 
Brandy Purdy as the writer (Pur), and is a novel (Nov). (2008) is the publication date and it is identified in 
the first database record creation (1) thus becoming its unique identifier or unique number. Please see 
Appendix E. 

5. Name authority control 

6. System evaluation and development 

6.1. Evaluation plan 

6.2. Performance test 

The research methodology for the performance test is a basic quantitative analysis of query precision and 
recall. Research methodologies are either quantitative, which is the collection or measurement of hard 
data, such as time and costs; or research methodologies are qualitative, which is the collection of soft 
data, such as images and text. Quantitative data analyses are data measurements gathered through 
statistics; while qualitative data analyses are data measurements of intellectual content and meaning. 
Because the performance test is a measurement of systemic recall and precision, it is a quantitative 
analysis. The user is given a simple task of conducting queries or searches of the database and is asked 
to retrieve records based upon the criteria of the user questions. The test user matches the user IOP 
user group demographics. He is a first year college student, nineteen years of age, and an English major. 
He comes from an upper middle class family and north Texas school system. His general knowledge is 
moderate to high, as is the user groups' general knowledge. His domain knowledge is moderate, a little 
lower than the user group's domain knowledge. He is not a British History major or an Anglophile, unlike 
the user group who are European History enthusiasts, but his is a former Advanced Placement History 
high school student. His system knowledge probably does not exceed that of the user group. He is only 
a freshman in college and still taking General Education courses. His information seeking behavior is 
probably higher than that of the user group on average. But some individual students in the user group 
probably compare with him. The test subject is told the general perimeters of the search that he needs to 
locate records in a database using the user questions and object attributes as a guide. The objective is to 
retrieve as many relevant records as possible. He is given the users tools of the Thesaurus and Name 
Authority File because he is not as well versed in the subject matter as the user group and needs some 
cues about the format and category requirements. 

User question 1 : Please help me find two or three books written by Alison Weir in the last twenty years 

which discuss Perkin Warbeck and other pretenders to the throne during the Wars of the Roses for a 

research paper? 

Object attributes: Subject, Author, Date of Publication, Character Name 

Desired precision: High 

Desired recall: Low to Moderate 
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Query formulation: Field searched Author and term entered "Weir, Alison" 

Probable precision: High 

Probable recall: Low 

Analysis of results: The results were as expected. The actual recall was low and actual precision was 

high. The user's strategy was very successful. Two relevant records were input into the database and the 

user retrieved both records. The precision was high because the user retrieved both records from the 

database based upon the requirements of the user question. The probable recall was low. This was 

lower than the desired recall, but there were two records input in the database, not three. Recall was 

going to be low, regardless of the performance test. Therefore the performance test results were as 

expected. 

User question 2: Where do I find three or four books, over 200 pages each, hardcover and written by 

Lady Antonia Fraser on the British monarchy from the Norman Conquest to the Restoration for a paper? 

Object attributes: Setting, Book Cover, Publisher, Length 

Desired precision: High 

Desired recall: Moderate 

Query formulation 1: Field searched Author and term entered "Fraser, Lady" 

Probable precision: High 

Probable recall: Moderate 

Analysis of results: The results were as expected, no results were found. This user strategy was not 

successful. The actual recall was none and the actual precision was none. Lady Fraser was not included 

as a Variant Name in the Name Authority File. Therefore the user was not going to be able to locate any 

records in the database using the term "Fraser, Lady." However, probable recall was moderate and 

probable precision was high which correlated to desired precision and desired recall given the data 

entered into the database records. Therefore, the performance test results were as expected. 

Query formulation 2: Field searched Author and term entered "Fraser, Antonia" 

Probable precision: High 

Probable recall: Moderate 

Analysis of results: The results of this query were also as expected. This second query applied a more 

successful search strategy. The actual precision was high and the actual recall was moderate. Three 

relevant records were input into the database and the user retrieved all three records. Probable precision 

was high because the database only contained relevant records to this user question. Probable recall was 

lower than the desired recall, because the database contained fewer records than the number of objects 

requested in the user question. Therefore the performance test results were as expected. 

User question 3: Help me locate four, first edition biographies on Queen Victoria with illustrations on the 

cover, syndicated outside Great Britain and in English for my term paper? 

Object attributes: Language, Genre, Book Edition, Cover Art, Place of Publication 

Desired precision: High 

Desired recall: Moderate to High 

Query formulation: Field searched Language and term entered "English" 

Probable precision: High 

Probable recall: Moderate 

Analysis of results: The performance test results were not quite as expected. The user's search 

strategy was not as successful. Broad user search terms yield many record results. The user input 

"English" into the Language database field and all records were retrieved. System recovery exceeded 

desired and probable recall. There were and still are three relevant records in the database; however, all 

ten database records were retrieved. The actual precision was low to moderate. The actual recall was 

high. The probable precision was high given the accuracy of terms input into the database to the user 

questions. Probable recall was lower than desired recall, because there were fewer records input into the 

database than objects requested in the user question. Investigation shows that other search terms have 

to be used in tandem with Language search terms in a query to reduce the recovery of records in excess 

of recall. 
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User question 4: I need two contemporary historical fiction books on the Tudors which have been made 

into movies. Please help me locate them? 

Object attributes: Movie Connection, Book Content 

Desired precision: High 

Desired recall: Low 

Query formulation: Field searched Setting and term entered "Tudor Stuart England" 

Probable precision: Moderate 

Probable recall: Low 

Analysis of results: The performance test results were not quite as expected. The user's search 

strategy was not as successful. Entering the term Tudor Stuart England yielded more results than were 

relevant. The actual precision was low and the actual recall was high. There are four records in the 

database with the Setting of Tudor Stuart England and only two of those records have movie connections 

and are relevant. The probable precision for this query was moderate rather than the desired precision of 

high. Because the inability of the database records to fully match the user questions, the database is 

imprecise. The probable and desired recalls were both low, but the actual recall was high. There were 

two relevant records input into the database, but the recall exceeded that number. Again there is a flaw 

in the system. Investigation shows that other search terms have to be used in tandem with Setting 

search terms in a query to reduce the recovery of records in excess of recall. 

System performance based upon the "solo" performance test was average. The queries did retrieve 
results based upon the user questions, but the desired results were not always retrieved. The system 
tools, such as the database, and field records needed more specificity for accurate and precise results. 
First, the system did work. It yielded results in all queries except where the Variant Name was not 
included in the Name Authority File. The test user actually thought "Lady" was Antonia Fraser's first 
name. Secondly, when the search term was author and the author's name was input into the query there 
was a match between the desired, probable and actual precision. Recall varied because of the number of 
records input into the database varied from the number of desired objects in the user question. Those 
were the highlights of the system. What did not work harmoniously in the information retrieval process 
were the thesaurus, the database field specificity and the abilities of the test user. The user felt the 
thesaurus was too general. He stated that it needed to be more targeted and limited in scope. The user 
could not understand it and did not use it. He attempted no subject queries, because of the complexity of 
the thesaurus. Secondly, the fields in the database records were different from the object attributes, in 
some cases. Additional fields extrapolated from more pertinent metadata elements in the database 
record were needed. For example, the user kept looking for a character name field or a movie connection 
field for query searches. He had difficulty conducting the search without those fields. It is basic 
information was not provided. For more precise query searches, there possibly needed to be more 
database fields or fields with more relevance to the object attributes. Also, the skills of the test user were 
not as adept as the user group. The user's familiarity with the subject matter was not as high as the user 
group. He had difficulty in comprehending some fields. His deficiencies in domain knowledge and 
confusion in the criteria for record fields exacerbated the other defects in the system. While, he thought 
that the system was easy to use he was perplexed about some of the intricacies and details of system. 
He asked more questions than the tools he was given could answer. To the extent the performance test 
measured whether desired records could be retrieved from the system based upon the user questions, 
the test was valid. Validity is the extent a study measures or gages what it is suppose to measure or 
gage. Yes, the system performed and well when the query search term was Author. It still performed 
when the query search term was not Author, but not as accurately. The system reliability is questionable. 
Reliability is the extent to which the study produces or obtains consistent and repeatable results. 

6.3. Change and development 

From the analysis and assessment of the performance test, system performance and user satisfaction, 
there are improvements to be made in the information organization system. User satisfaction was low 
with the information system. Not that any monetary costs or opportunity costs were incurred by the test 
user, but that time and quality were adequate. There was additional time used because the query for 
user question two was repeated and there was a significant amount of time spent in understanding the 
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system intricacies, tools and structure. The user spent over fifteen minutes reading and rereading the 
thesaurus and staring at the display screen. There was no extensive wait time or lag in retrieving the 
query results, just in the user comprehending the search itself. Also, the quality of the system was 
average to the user. Quality is subjective, but the user expressed dissatisfaction with the thesaurus and 
consistency of material. Recovery exceeded recall on two queries, and user expressed that more 
consistent object attributes with database fields would be beneficial. More harmony from the user 
questions to object attributes to the metadata elements and database fields would reduce unnecessary 
recovery and make a more effective information retrieval system. The user felt the thesaurus was too 
broad and irrelevant. Improvements to its design are discussed in the subsequent paragraphs. The user 
did indicate the system worked well and if he knew more about the topic, he would have performed better. 
The novelty of the output, the newness of the system, does effect user satisfaction and should factor into 
system evaluation. However, the novelty of the subject matter should not be a factor, as the user group 
should be well versed in the subject. The performance test user was not ideal and as astute in the subject 
material as the system user group. 

There are other system weaknesses that affect system performance and information retrieval. The 
deficiencies are: in the user group, technological structure and information environment and this 
evaluation addresses each. First, the age demographic of the user group should be raised. The user 
group needs to have more domain knowledge than a high school AP European History student. Perhaps 
college British or European history students or political science students would be more proficient user 
groups. The difficulties of the performance test user to master the system were compounded by his 
unfamiliarity with the subject matter. The user group needs to have more than a general or working 
knowledge of the material. The system could still work for AP European History students but they have 
more than casual knowledge of the material. 

Technological structure improvements include enhancing the thesaurus and increasing the database 
fields. The performance test showed the flaws in both of the structures. There are deficiencies in the 
semantics of the thesaurus. At present the thesaurus is too broad and immaterial to the user questions. It 
should be revised with more consistent terms to the user questions and made more understandable. 
Currently, it is geared toward a user who is very familiar with the subject matter that needs little direction. 
A more targeted, limited in scope and user friendly thesaurus with simpler terms more consistent with the 
user questions would be an improvement. If there was more time, the system designer could revise the 
thesaurus and make all the necessary adjustments. The other structures which need improvement are 
the database fields. There are not enough fields or they are not as specialized as they should be for 
more successful information retrieval. The only field which worked successfully in the user performance 
test was Author where the Author's name was mentioned in the user question. The additional fields of 
Character Name and Movie Connection definitely should be added into the database and perhaps more 
fields which are more consistent with the user questions and object attributes should be considered. .An 
unforeseen defect or problem was the Name Authority File. There should have been more VariantNames 
forms included. The performance test revealed that all possibilities for unauthorized names were not 
accounted within the system. Therefore, the file has to be expanded and more forms included. With more 
time and foresight these few enhancements could be made to make the information system retrieval more 
successful. 

Within the information environment, changes need to be made. The query interface was intimidating for 
the performance test user. He asked many questions which could not be answered and wanted help. A 
more user friendly environment would be helpful. An environment with cues and prompts that could guide 
queries and searches, so that the user is not so alone and has aids or "help" in conducting the searches 
would be beneficial. More interactive, user-friendly and approachable interface would appeal to the 
information technology savvy database users and to the system designers as well. Database designers 
and creators direct system structure. More appealing features for database creators will in turn lead to 
more appealing features for the database users. System designers would invest more system creativity if 
the information environment were more stimulating. Somehow, the environment needs an overhaul. 
More expensive software systems and packages might contain these enhanced components. Spending 
more money on "deluxe " software programs, could improve the information environment from the 
system designer's and system user's prospective and make for more successful information retrieval. 
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The entire system needs to be less complicated. While more time would improve some system features 
and more money could improve others. The results of the performance test were that the search process 
was perplexing and inconsistent. There was not enough matching or consistency between the user 
questions, object attributes, database fields and thesaurus. Consequently, recovery exceeded recall for 
two queries. While the information system was not a complete failure, information retrieval was not as 
efficacious as it could or should be. The modifications mentioned in the previous paragraphs could 
enhance the system or perhaps starting over would be better. More studies and tests with the same user 
group or slightly older but incorporating the technical and information environmental changes are 
necessary. Long term surveys would highlight unnecessary and dysfunctional system features and 
reveal which components routinely work. It is difficult to make a statistically accurate assessment of 
information retrieval system performance after one performance test. True system efficacy, usefulness of 
Information Retrieval results, and efficiency, costs and speed, comes from extensive cost-benefit and 
cost-effective analysis and ongoing and independent testing. This is just one system evaluation which 
suggests changes to enhance system development, there needs to be further testing and further 
evaluation to see which components regularly work and which do not. 

7. Project summary 

Wow, I had no idea when I first selected my collection for the IOP, where the process would lead. It has 
been a gauntlet of blood, sweat and tears, quite a few tears at times. I have learned much and would like 
to share a few of my thoughts. If I had to do it all over again, I would choose the same collection but I 
would make some drastic changes to my system. I love European history, vacillated between being a 
Russian History and International Political Science major in college. Alison Weir is my favorite author and 
she has a new book coming this fall on the Houses of Lancaster and York. Therefore, I would not have 
changed the collection. I would have changed a couple of user questions, but I had such difficulty 
formulating my user questions that I felt obligated to them. I did not think that they were so challenging, 
but given the obstacles my user faced maybe the questions should have been more general. I do know 
that the object attributes did not translate well into the metadata elements; consequently, there was 
disconnect between the fields in the records and object attributes. I would have to add more elements 
and more field to encapsulate all of the object attributes. I did not foresee any problems with number of 
fields or their semantics before the performance test. I am so attached to the subject matter that I could 
not think objectively. I was completely oblivious until the test and the user asked for the Character Name 
field. At that moment, a light bulb went off and I realized that I should have included one. Also, I knew 
there were some problems with the thesaurus, but it probably should be totally revised. Again I think it 
needs to be more consistent with the user questions. It is not that it is difficult to use; it seemed to be 
more cumbersome to the user during the queries. I would have to take much more time to think about 
how I would approach constructing a new one. I honestly did not know what I was doing, until I started 
working on it. Again, another light bulb flashed in my head and I realized what I was supposed to be 
creating. Knowing what I know now, I could construct a better thesaurus; I know that I would need more 
time than the class permits to do it. 

I am not sure whether my system is better than a traditional system or whether or not it is likely to be 
implemented. I think that my system has some merit and could be modified to make it more feasible. My 
system is not that complex or complicated. It definitely helps, if users have more than a general know of 
the subject matter. I could see my system implemented in a British Historical society or a British 
Monarchist Guild or Tudor-Stuart Book club. I would think that the system has to find a niche. To make 
the organization system more general, I would have to change the demographic, the user questions and 
comprise an entirely new "organism." I was not my goal and I am not sure that would want to undertake 
that endeavor. If I had to make a more general system for average high school students, I could to it. I 
just did not envision this system for those types of students or for a generic purpose. 

I did learn more than I could have imagined from this class. Even though I am a technical writer, I have 
not worked with any metadata; in fact I have yet to work in library. My background is Public Health and 
International Political Science and soon I hope to be working in a medical library. I feel that I have gained 
more insight into creating information databases and how they work. I never thought about the inner 
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workings of an information organization system before this class. I do understand now how the layers 
and components build upon one another and how each needs to be functionally sound. The only desire 
which I could express would be to have more time to construct the organization system. I did not fully 
comprehend the endeavor until a month into the class. I know that is not possible with the time 
constraints of the class, to allot more time on certain sections. It also is difficult to omit certain sections 
because all of them integral to system performance. For example, deciding to give the user in the 
performance test the Name Authority File when it is not discussed in the project was a difficult judgment 
call for me. I felt that user needed it. He probably needed copies of the book in his hand, but that is 
subject for the Texas Public System. Nonetheless, having omitted Name Authority Control from the IOP, 
should it be included in the queries. Furthermore what happens to the system when components are 
omitted? These are tough decisions when more time is allocated to certain sections and other IOP 
sections are discarded. I am not sure what the best approach is. I know this system was my best effort, 
given the knowledge which I had at the moment of its inception. I would produce a more furtive and 
successful information retrieval system today, but that is only because I learned what I did in this class. 
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Appendix A. Metadata elements and semantics 



No. 


Element name 


Semantics 


1 


Title 


The name of the work 


2 


Author 


The writer of the work 


3 


Length 


The amount of content between the beginning and the end of the work 


4 


Date of Publication 


The date which the syndication of the work occurs 


5 


Language 


The formal system of words written or spoken in the work 


6 


Setting 


The time and place in which the action of the work occurs 


7 


Publisher 


The syndicator of the work 


8 


ISBN 


International Standard Book Number 


9 


Genre 


Literary form of a particular work 


10 


Subject 


Area of interest of a work 


11 


Classification 


Systematic organization of a collection according to communities 
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Appendix B. Record structure and specifications 
1. Record structure specifications 



No. 


Field name 


Field type 


Indexing 


Entry validation 


Content 
validation 


1 


RecordID 


Autonumber 


T, W 


None 


None 


2 


RecordDate 


Autodate 


T 


None 


None 


3 


Title 


Text 


T, W 


S, R 


None 


4 


Author 


Text 


T, W 


R 


None 


5 


Length 


Text 


T, W 


S. R 


List 


6 


Date of Publication 


Date 


T 


S 


Mask 


7 


Language 


Text 


T,W 


S, R 


List 


8 


Setting 


Text 


T, W 


R 


List 


9 


Publisher 


Text 


T, W 


R 


None 


10 


ISBN 


Text 


None 


S 


Mask 


11 


Genre 


Text 


T, W 


S, R 


List 


12 


Subject 


Text 


T, W 


R 


List 


13 


Classification 


Text 


T, W 


None 


None 



Key 

T= Term Indexing 

W= Word Indexing 

S= Single Entry Validation 

R= Required Entry Validation 

U= Unique Entry Validation 

2. Textbase structure 

Textbase Structure 



List= Content Validation List 
Mask = Content Validation Mask 



Textbase: C:\Documents and Settings\Owner\My Documents\appendixtextbase 
Created: 7/12/2009 12:12:47 PM 
Modified: 7/12/2009 12:12:47 PM 



Field Summary: 

1. RecordID: Automatic Number(next avail=l, increm=l), Term & Word 

2. RecordDate: Automatic Date(Both Date and Time, When Created), Term 

3. Title: Text, Term & Word 

Validation: required, single-only 

4. Author: Text, Term & Word 

Validation: required 

5. Length: Text, Term & Word 

Validation: required, single-only, valid-list 

6. Date of Publication: Date, Term 

Validation: single-only, mask **** 

7. Language: Text, Term & Word 

Validation: required, single-only, valid-list 

8. Setting: Text, Term & Word 

Validation: required, valid-list, update-valid-list, override 

9. Publisher: Text, Term & Word 

Validation: required 

10. ISBN: Text 

Validation: single-only, mask ***-*-***-*****-* 

11. Genre: Text, Term & Word 

Validation: required, single-only, valid-list 

12. Subject: Text, Term & Word 
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Validation: required, valid-list, update-valid-list, override 
13. Classification: Text, Term & Word 

Log file enabled, showing 'RecordlD' 

Leading articles: a an the 

Stop words: a an and by for from in of the to 

Textbase Defaults: 

Default indexing mode: SHARED IMMEDIATE 
Default sort order: <none> 
Textbase passwords: 
Master password = ' ' 
Access passwords: 
No silent password 
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Appendix C. Record content and input rules 

Field Name: RecordID 

Semantics: Specific and singular record identification number 

Chief Source of Information: Automatically input by database 

Input Rules: Automatically input by database 

Example: 1 

Field Name: RecordDate 

Semantics: Date and time of record generation and revision 

Chief Source of Information: Automatically input by database 

Input Rules: Automatically input by database 

Example: 06/24/2009 20:40:30 

Field Name: Title 

Semantics: Name of the book 

Chief Source of Information: Title page of the book 

Input Rules: Enter and spell title exactly as found on book. Use sentence style capitalization. 

Example: The Life and Times of Victoria 

Field Name: Author 

Semantics: The writer of the book 

Chief Source of Information: Title page of the book 

Input Rules: Use authorized form of name: Last name, First name. If book has no author, input 

"unknown" in the record. 

Example: Weir, Alison 

Field Name: Length 

Semantics: The amount of pages between the front cover and the back cover of the book. 

Chief Source of Information: The book's pagination 

Input Rules: Less than 150 pages is brief; 151 pages to 300 pages is moderate; 301 pages and above 

is long. Select format in validation list. Press F3. 

Example: 544 pages is a long book 

Field Name: Date of Publication 

Semantics: The calendar date in which the syndication of the book occurs 

Chief Source of Information: Title page of book 

Input Rules: Enter field as four digit year in validation mask. Press F3. 

Example: 1989 

Field Name: Language 

Semantics: The formal system of words in the book 

Chief Source of Information: The physical book itself 

Input Rules: Select the desired language of the book from terms listed in a validation list. Press F3. 

Example: English 

Field Name: Setting 

Semantics: The time and place in which the action of the book occurs. 

Chief Source of Information: Front cover, back cover or inside book jacket 
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Input Rules: Select desired setting from list of terms in a validation list. Press F3, If desired setting not 
found in list, user may input his own term. 
Example: Victorian England, Tudor-Stuart England 

Field Name: Publisher 

Semantics: The syndicator of the book 

Chief Source of Information: Title page of the book 

Input Rules: Enter official company name as it appears in the book. 

Example: Oxford University Press, iUniverse Inc. 

Field Name: ISBN 

Semantics: International Standard Book Number 

Chief Source of Information: Back cover of the book 

Input Rules: Enter numeric digits in a ten or thirteen number sequence according to the validation list 

mask format. Published books before 2007 have ten digit ISBN's while published books after 2007 have 

a thirteen digit ISBN. Leave the three digit prefix blank in mask for those published books before 2007. 

Example: 978-0-521-40677-2 

Field Name: Genre 

Semantics: Literary form of a particular type of work 

Chief Source of Information: The book front cover, back cover or inside jacket 

Input Rules: Select literary form from a list of terms in a validation list as indicated. Press F3. 

Example: Biography, Historical Novel 

Field Name: Subject 

Semantics: Area of interest of the work 

Chief Source of Information: Table of Contents 

Input Rules: Accept authorized terms from thesaurus, Appendix D. Index terms as specifically and 

exhaustively as possible. If idea or term is not in the thesaurus, add it. Capitalize Proper Nouns, 

otherwise use lowercase letters and symbols. 

Example: Prince of Wales, law 

Field Name: Classification 

Semantics: Systematic organization of a collection according to communities. 

Chief Source of Information: See Appendix E 

Input Rules: See Appendix E 

Example: Han.Eri.Bio.2002/10 
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Appendix D. Sample Thesaurus 

Key at bottom of Thesaurus 

Adultery 

UF Infidelity NT Black Death 

RT Marriage NT Bubonic Plague 

RT Matrimony NT Plague 

Anglican Execution 

RT Church of England NT Beheading 

BT Religion RT Death Penalty 

BTLaw 
Aristocracy 

UF Gentry Favor USE Patronage 

RT Royalty 

Gentry USE Aristocracy 
Bastard USE Illegitimate Heir 

Illegitimate Heir 
Beheading UF Bastard 

BT Death Penalty RT Children 

BT Execution RT Issue 

Black Death Infidelity USE Adultery 

RT Bubonic Plague 

BT Epidemic Issue 

BT Plague RT Children 

RT Illegitimate Heir 
Bubonic Plague RTWard 

RT Black Death 

BT Epidemic Law 

BT Plague NT Death Penalty 

NT Execution 
Catholic NT Succession 

BT Religion 

Law of Primogeniture 
Children RT Salic Law 

RT Illegitimate Heir BT Succession 

RT Issue 
RT Ward Marriage 

RT Adultery 
Crown Prince RT Matrimony 

RT Prince of Wales 

Matrimony 
Church of England RT Adultery 

RT Anglican RT Marriage 

BT Religion 

Patronage 
Death Penalty UF Favor 

NT Beheading 

RT Execution Plague 

BT Law NT Black Death 

NT Bubonic Plague 
Epidemic BT Epidemic 
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Prince of Wales 

RT Crowned Prince 

Protestant 

BT Religion 

Religion 

NT Anglican 

NT Catholic 

NT Church of England 

NT Protestant 

Royal Absolutism 

RT The Divine Right of Kings 

Royalty 

RT Aristocracy 

Salic Law 

RT Law of Primogeniture 
BT Succession 

Succession 

NT Law of Primogeniture 
NT Salic Law 

The Divine Right of Kings 

RT Royal Absolutism 

War 

NT War of the Roses 

War of the Roses 

BTWar 



Key: 

BT Broad Term 
NT Narrow Term 
RT Related Term 
UF Used For 
USE Use 
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Appendix E. Classification scheme 
1. Scheme 





A 
Theme 


B 
Author 


C 
Form 


D 
Date 


1 


General (Gen) 


(see rule) 


Biography (Bio) 


(see rule) 


2 


Norman (Nor) 




History (His) 




3 


Plantagenet (Pla) 




Novel (Nov) 




4 


Tudor (Tud) 








5 


Stuart (Stu) 








6 


Hanoverian (Han) 








7 


Windsor (Win) 









2. Notation rules 

Facet name: Theme 

Chief source of information: Title page or Table of Contents 

Notation rules: Use abbreviations as displayed in Table. Classify as General, if no dynastic theme is 

evident. Add a theme, if none listed for the object. Follow class codes with a period. 

Facet name: Author 

Chief source of information: Title page 

Notation rules: Use first three letters of author's last name. Capitalize the first letter. If multiple authors, 

use first author only. If object has not author, omit it. End with a period. 

Facet name: Form 

Chief source of information: Book Cover, title page or text 

Notation rules: Use abbreviation exactly as displayed in Table. If object is a format not listed, add a 

class for it. Follow with a period. 

Facet name: Date 

Chief source of information: Title page 

Notation rules: Use four digit year of publication. If no year is published, omit facet. End with a slash. 



3. Rule for unique number 

Appendix RecordID number from database until the end of notation. Do not punctuate. 

4. Example: Tud.Pur.Nov.2008/1 

The classification code Tud. Pur. Nov. 2008/1 signifies the book Vengeance Is Mine, which has a Tudor 
theme (Tud), has Brandy Purdy as its writer (Pur) and is a novel (Nov). (2008) is the publication date and 
it is identified in the first database record creation (1) making it the unique identifier. 



Appendix F. Name authority file 
1. Record structure specifications 
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No. 


Field name 


Field type 


Indexing 


Entry validation 


1 


RecordID 


Autonumber 




— 


2 


RecordDate 


Autodate 




— 


3 


AuthorizedName 








4 


VariantNames 








5 


SourcesUsed 









2. Textbase structure (See Inmagic Tutorial in online course content for what goes here) 



3. Record content and input rules 

1. Field name: 
Semantics: 
Input rules: 
Example: 

4. Sample records 
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Appendix G. Sample records 
RecordID: 1 

RecordDate: 7/12/2009 12:51:32 

Title: Vengeance Is Mine 

Author: Purdy, Brandy 

Length: moderate (151-300 pages) 

Date of Publication: 2008 

Language: English 

Setting: Tudor Stuart England 

Publisher: iUniverse, Inc. 

ISBN: 978-0-595-48124-8 

Genre: Historical Novel 

Subject: adultery, Church of England, beheading, death penalty, Protestant 

Classification: Tud.Pur.Nov.2008/1 

RecordID: 2 

RecordDate: 7/12/2009 13:38:03 

Title: The Lives of the Kings and Queens of England 

Author: Fraser, Antonia 

Length: long (300 pages or more) 

Date of Publication: 2000 

Language: English 

Setting: England/Great Britain 

Publisher: University of California Press 

ISBN: ***-0-520-22460-4 

Genre: Biography 

Subject: succession, The Divine of Right of Kings, war, Black Death, Bubonic Plague, Law of 

Primogeniture, religion 

Classification: Gen.Fra.Bio.2000/2 

RecordID: 3 

RecordDate: 7/12/2009 13:46:32 

Title: James, VI of Scotland, I of England 

Author: Fraser, Antonia 

Length: moderate (151-300 pages) 

Date of Publication: 1994 

Language: English 

Setting: Tudor Stuart England 

Publisher: George Weidenfeld & Nicholson 

ISBN: ***-0-297-8331 7-0 

Genre: Biography 

Subject: succession, Prince of Wales, war, Catholic, Protestant 

Classification: Stu.Fra. Bio. 1994/3 

RecordID: 4 

RecordDate: 7/12/2009 13:50:10 

Title: Charles II 

Author: Fraser, Antonia 

Length: long (300 pages or more) 

Date of Publication: 2007 

Language: English 

Setting: Tudor Stuart England 

Publisher: Phoenix 

ISBN: 978-0-753-81403-1 

Genre: Biography 

Subject: illegitimate heir, adultery, Catholic, religion, patronage, Protestant, Church of England 
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Classification: Stu.Fra.Bio.2007/4 

RecordID: 5 

RecordDate: 7/12/2009 13:54:54 

Title: The Wars of the Roses 

Author: Weir, Alison 

Length long (300 pages or more) 

Date of Publication: 1995 

Language: English 

Setting: Plantagenet England 

Publisher: Ballentine 

ISBN: ***-0-345-391 17-9 

Genre: Historical Novel 

Subject: war, War of the Roses, marriage, issue, aristocracy, royal absolutism, crown prince 

Classification: Pla.Wei.His.1 995/5 

RecordID:: 6 

RecordDate: 7/12/2009 14:00:08 

Title: The Life and Times of Queen Victoria 

Author: Marshall, Dorothy 

Length: moderate (151-300 pages) 

Date of Publication: 1995 

Language: English 

Setting: Victorian England 

Publisher: Shooting Star Press 

ISBN: ***-1 -558-59450-7 

Genre: Biography 

Subject: royalty, children, Prince of Wales, Anglican, epidemic 

Classification: Han. Mar. Bio. 1995/6 

RecordID: 7 

RecordDate: 7/12/2009 14:20:29 

Title: The Other Boleyn Girl 

Author: Gregory, Philippa 

Length: long (300 pages or more) 

Date of Publication: 2008 

Language: English 

Setting: Tudor Stuart England 

Publisher: Touchstone 

ISBN: 978-1-416-56060-9 

Genre: Historical Novel 

Subject: succession, execution, beheading, crown prince, royal absolutism, religion 

Classification: Tud.Gre.Nov.2008/7 

RecordID: 8 

RecordDate: 7/12/2009 14:30:08 

Title: The Princes in the Tower 

Author: Weir, Alison 

Length: moderate (151-300 pages) 

Date of Publication: 1995 

Language: English 

Setting: Plantagenet England 

Publisher: Ballentine 

ISBN: ***-0-345-391 78-0 

Genre: Historical Novel 

Subject: War of the Roses, succession, marriage, law, war, execution 
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Classification: Pla.Wei. His. Bio/1 995 

RecordID: 9 

RecordDate: 7/12/2009 14:39:41 

Title: Victoria Victorious 

Author: Palady, Jean 

Length: long (300 pages or more) 

Date of Publication: 2005 

Language: English 

Setting: Victorian Great Britain 

Publisher: 3 Rivers Press 

ISBN: ***-0-609-81 024-3 

Genre: Biography 

Subject: marriage, matrimony, issue, epidemic, The Divine Right of Kings 

Classification: Han.Pal.Bio.2005/9 

RecordID: 10 

RecordDate: 7/12/2009 14:46:58 

Title: Her Little Majesty 

Author: Erickson, Carolly 

Length: long (300 pages or more) 

Date of Publication: 2002 

Language English 

Setting: Victorian England 

Publisher: Simon & Schuster 

ISBN: ***-0-743-23657-2 

Genre: Biography 

Subject: children, Salic Law, issue, aristocracy, matrimony, Prince of Wales 

Classification: Han.Eri.Bio.2002/10 



